FastqPuri: high-performance preprocessing of RNA-seq data
نویسندگان
چکیده
منابع مشابه
Optimization of miRNA-seq data preprocessing
The past two decades of microRNA (miRNA) research has solidified the role of these small non-coding RNAs as key regulators of many biological processes and promising biomarkers for disease. The concurrent development in high-throughput profiling technology has further advanced our understanding of the impact of their dysregulation on a global scale. Currently, next-generation sequencing is the ...
متن کاملInvestigating the Function of Predicted Proteins from RNA-Seq Data in Holstein and Cholistani Cattle Breeds
This study was performed to determine the digital expression profile of different genes expressed in Holstein and Cholistani breeds as well as to evaluate the performance of predicted proteins derived from differentially expressed genes between these two breeds using RNA-Seq data. For this purpose, the whole mRNA sequence for a blood sample of American Holstein and Pakistani Cholistani cattle p...
متن کاملButter: High-precision genomic alignment of small RNA-seq data
Eukaryotes produce large numbers of small non-coding RNAs that act as specificity determinants for various gene-regulatory complexes. These include microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). These RNAs can be discovered, annotated, and quantified using small RNA-seq, a variant RNA-seq method based on highly parallel sequencing. Alignment t...
متن کاملTBX20 RNA-Seq data subset
The TBX20 data set [4] provides ChIP-Seq and RNA-Seq data. In here only the RNA-Seq part of the data is utilized. The raw data where downloaded from Gene Expression Omnibus (GEO) [1], accession number GSM767225GSM767230. TBX20 (T-box 20) in general is a transcriptional regulator essential for cardiac development and maintenance of mouse heart tissue. In this study TXB20 was knocked-out by using...
متن کاملSam2bam: High-Performance Framework for NGS Data Preprocessing Tools
This paper introduces a high-throughput software tool framework called sam2bam that enables users to significantly speed up pre-processing for next-generation sequencing data. The sam2bam is especially efficient on single-node multi-core large-memory systems. It can reduce the runtime of data pre-processing in marking duplicate reads on a single node system by 156-186x compared with de facto st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2019
ISSN: 1471-2105
DOI: 10.1186/s12859-019-2799-0